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We apply the computational methodology of phase re- 
trieval to the problem of folding heteropolymers. The 
ground state fold of the polymer is defined by the intersec- 
tion of two sets in the configuration space of its constituent 
monomers: a geometrical chain constraint and a threshold 
constraint on the contact energy. A dynamical system is 
then defined in terms of the projections to these constraint 
sets, such that its fixed points solve the set intersection prob- 
lem. We present results for two off-lattice HP models: one 
with only rotameric degrees of freedom, and one proposed 
by Stillinger et al.\\\ with flexible bond angles. Our phase 
retrieval inspired algorithm is competitive with more estab- 
lished algorithms and even finds lower energy folds for one 
of the longer polymer chains. 

A favorite metaphor in the field of nonlinear optimization, 
and computational protein folding in particular, is the en- 
ergy landscape. Energy landscapes have been compared to 
funnels|2|, golf-courses|3|, and are generally held responsible 
for all the behavior observed in nature, as well as the challenges 
faced by simulators. Kinetics simulations are, by their very na- 
ture, tied to the topography of the energy landscape and cannot 
avoid scaling its barriers and languishing in its manifold min- 
ima. The outlook for native fold discovery, however, is more 
optimistic. As we show below, for this problem there are op- 
tions that escape the confines of the energy landscape and yield 
significant computational dividends. 

Most native fold search strategies are conservative in at least 
two respects. First, the search is carried out in the same space 
accessed by the physical degrees of freedom of the protein. Sec- 
ond, the search in this space is carried out quasi-locally, in the 
sense that every conformation examined is derived from a pre- 
viously considered conformation by a local modification. There 
are alternatives to these general guidelines that have proven ef- 
fective in other fields. For inspiration we turn to the classic 
problem of phase retrieval. 

The naive search space in phase retrieval is superficially 
equivalent to the space of rotamer configurations, each un- 
known phase angle <j> corresponding to a dihedral angle on the 
protein backbone. An important application of phase retrieval 
is the reconstruction of the electron density in a crystal, given 
its Fourier amplitudes Fq. 

p(r) =^F q cos(q-r + ( / )q ) (1) 
q 

The task of the algorithm is to find values for the phases q such 



that the resulting density satisfies certain general character- 
istics (e.g. positivity, atomicity) or constraints. To illustrate the 
idea, we consider a very simple situation where the given am- 
plitudes F q are derived from a density known to take only two 
values, say p = ±1. To implement the binary valued density 
constraint we could try minimizing a penalty function of the 
form 

V = J2(p(rf-l) 2 , (2) 

r 

where the positions r fall on a grid determined by the range over 
which the Fourier vectors q are sampled. This expression for 
V, an explicit function of the phase variables </> q , is a possible 
energy landscape for the phase retrieval problem. The correct 
phases are identified by discovering a point on the landscape 
where the energy realizes the minimum value V = 0. 

Practical phase retrieval algorithms do not minimize an ob- 
jective function as sketched above|4||5j|U. The most successful 
algorithms do not navigate the barriers and false minima of an 
energy landscape. Typically, the search performed by these al- 
gorithms is carried out in a much larger space (than the space 
of "rotamers") and the steps executed are global in character. 
The example above serves to illustrate the key elements of the 
search dynamics, called projections. There are two projections, 
both of which act on a density that has been freed of all con- 
straints. In particular, one no longer insists that p has the given 
Fourier amplitudes, that is, the form Q parametrized by phase 
angles. Instead, one uses the device of a projection Pa, which 
takes an arbitrary input density p and returns a minimal modi- 
fication of p where the given Fourier amplitudes have been re- 
stored. This can be computed efficiently, by first transforming p 
to Fourier space, making the necessary modification there, and 
then transforming back. The term "projection" is derived from 
the minimality condition, and in the case of Pa corresponds (in 
Fourier space) to mapping each complex Fourier coefficient to 
the nearest point on a circle whose radius is given by the cor- 
responding amplitude F q . The binary constraint on the density 
values is implemented by another projection, Pg, where mini- 
mality of the change calls for all positive values to be replaced 
by 1, negative values by —1. Each of the two projections ac- 
complishes something global, in effect solving half of the prob- 
lem to completion. The spectrum of modern phase retrieval al- 
gorithms arises both from the variety of the kinds of projections 
used, as well as variations in how they are combined|7|. 

Figure 1 shows successive iterates of a particular combina- 
tion of projections, called the difference map\l\, on the phase 
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Figure 1 : Difference map solution of a phase retrieval problem. 
Each horizontal row represents one iterate of the one dimen- 
sional density, with the initial density at the top and convergence 
to the binary valued solution at the bottom. 



retrieval problem for a binary valued density. The dynamics is 
deterministic and the discovery of the solution corresponds to 
the arrival at a fixed point of the map. Although the number 
of iterations required by the algorithm depends on the initial 
density, this number is always much less than the size of an 
exhaustive search. 

We show below that the projection technique can be applied 
to the protein native fold search problem, and that for simple 
off-lattice heteropolymer models the results are encouraging. 
After a brief review of the difference map scheme for combin- 
ing projections, we examine in detail the two projections that 
apply to the native fold search. We present results for two HP 
models, one with only dihedral degrees of freedom (rotamer 
model), and a model proposed by Stillinger et al.\l \ with vari- 
able bond angles (flexible chain model). For the longer chains 
the projection based algorithm was able to find lower energies 
than published results 1 8 9| obtained by methods that explore 
the energy landscape. 



Theory and Methods 

Difference map algorithm. The search space is in general a 
high dimensional Euclidean space E. Polymer conformations, 
for example, are embedded by associating three Cartesian co- 
ordinates of E with the position of each monomer in the chain. 
The goal of the algorithm is to discover one element x S AC\B, 
where A and B are subsets of E, usually having the character of 
constraints. In polymer applications, for example, set A might 
represent all monomer configurations that satisfy the chain con- 
straints (bond lengths, etc.). The constraint sets A and B are 
assumed to be simple enough that the two projections to these 
sets, Pa and Pg, can be computed efficiently. For example, to 
compute y = Pa(x), we need to find an element y 6 A that 
minimizes the distance \\y — x\\. In difference map applications 
one may relax the condition that y € A realizes the true mini- 



Figure 2: Comparison of alternating projections (left) and dif- 
ference map iterations (center) in the case of two constraint sets, 
a point and a line, that do not intersect. The alternating map 
Pa (Pb{x)) stagnates on set A; iterates of D(x) move uni- 
formly along the axis of nearest separation between A and B. 
When A and B intersect (right), every point in the space locally 
orthogonal to both constraints is a fixed point of D{x). 



mum of \\y — x\\, although this is usually easy to achieve when 
y is near enough to x that the constraint can be linearized. In 
general, the performance of the algorithm is improved by the 
distance minimizing quality of the projections. 

When the projections are combined in alternating fashion, 
x — ► Pa {Pb{x)), problems arise when there is a local mini- 
mum in the separation of the constraint sets. As shown in Fig- 
ure 2, this map will then have a fixed point x* = Pa (Pb(x*)) 
that lies in A but not B. The difference map is a more elaborate 
combination of projections given by 1 7 1 



x — > D(x) = x 
A(x)=P A (f B (x)) 



-0A(x) 
PbUa(x)) 



(3) 
(4) 



where 



f A (x) = Pa(x)-/3- 1 {Pa(x)-x) (5) 
Ib{x) = Pb(x)+/3- 1 (P b {x)~x) , (6) 

and (3 is a dimensionless parameter. At a fixed point x* = 
D(x*), we have A(x*) = and 



Pa Ub{x*)) = P B Ua{x*)) = x sol . 



(7) 



This shows that x so \ S A H B, since x so \ is in the range of 
both projections. The more straightforward definition A(x) = 
Pa(x) — Pb{x), which leads to the same conclusion, is not use- 
ful because the fixed points x* of D are then unstable. The maps 
Ja and fs are tuned to maximize the attraction of the difference 
map's fixed points| 10 1 . When confronted with a near intersec- 
tion of sets A and B, iterates of the difference map move at 
a uniform rate along the axis of nearest separation, as shown 
in Figure 2. The step size in the latter situation decreases in 
proportion to the distance between A and B, and the flow de- 
generates into a space of fixed points when A and B intersect. 

Studies of hard optimization problems, such as phase re- 
trieval, point to the following sequence of events in the differ- 
ence map solution process. Starting from an arbitrary initial 
point so G E, the iterates very quickly converge on a much 
smaller subset, a quasi-attractor Q. The dynamics on Q is 
chaotic, and Q would be a true (chaotic) attractor in an ill-posed 
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Figure 3: Spaces sampled by optimization algorithms: "re- 
tainer" space for two dihedral angles (left), difference map 
quasi-attractor (right). The dimension of the quasi-attractor is 
smaller than that of the rotamer space, even though it is embed- 
ded in a higher dimensional Euclidean space. The large point 
represents the solution. 



problem instance, when A n B is empty. Since the two projec- 
tions are in fact very insensitive to the existence of a solution, 
it follows that the dynamics in a well-posed instance is similar, 
only differing when the iterate arrives at the attractive basin of 
a fixed point and the algorithm terminates. A cartoon compar- 
ison of exhaustive "rotamer" search and difference map search 
is given in Figure 3. 

Heteropolymer models. We consider two off-lattice het- 
eropolymer models, with monomer-monomer interaction of the 
Lennard-Jones form: 

^=4 EE (^-^) ■ ^ 

i=l j=i+2 \ l 3 ij / 

N is the number of monomers, ry is the vector separation of 
monomers i and j with magnitude |ry| = r^ , and = Cji 
are constants that depend on the hydrophobic (H) and polar (P) 
character of the monomers. For the flexible chain model pro- 
posed by Stillinger et q/.fTI. 

Chh = 1 Chp = — — Cpp = - . (9) 
Another model we study, the rotamer model, has 

Chh = 1 Chp = Cpp = - . (10) 

The main difference between the flexible chain and rotamer 
models is the nature of the constraints on the polymer chain. 
In the flexible chain model only the bond length is fixed, 
ra+i — 1; in the rotamer model the bond angles are fixed as 
well: r,_i j ■ r.- L j+i = cos a. Since the latter constraint fixes the 
distances ra + 2, these terms are excluded from the sum in (jSJl 
for the rotamer model. The flexible chain model adds a bond 
angle energy favoring linear conformations: 

E ch&in = -^2(1 - Ti-n ■ r ii+1 ) . (11) 

i=2 



Constraint projections. Protein conformations are subject 
to two, typically antagonistic, constraints. In order to function, 
proteins adopt a compact shape with stability and functionality 
conferred by the three dimensional packing of its constituent 
amino acid residues. In order for the protein to be synthesized, 
however, the arrangement of the residues must also correspond 
to a possible conformation of a polypeptide chain. Either of 
these constraints would be much easier to satisfy if the other 
could be neglected, and there would then be a multitude of so- 
lutions. The difficulty in finding the native fold, from this per- 
spective, is finding a configuration of residues that satisfies both 
constraints. We discuss later how this point of view provides a 
basis for understanding the uniqueness of the native fold. 

The application of the difference map algorithm to the model 
proteins described above involves three things: specifying the 
embedding, defining the constraint sets, and computing projec- 
tions to the constraint sets. We embed both models in a Eu- 
clidean space E of dimension 3N in the standard way: three 
Cartesian coordinates for each monomer position. The con- 
straint sets A and B correspond to the chain constraints and 
packing constraints, respectively. 

Set A in the flexible chain model is the set of all monomer 
configurations in E with r; j + i = 1, while in the rotamer model 
we impose the additional constraint rj_i , • j+i = cos a (for a 
given a). The projection to A, or Pa, is computed with the aid 
of a penalty function V^hain- For the rotamer model we used 

N-l N-l 

Khain= E( r4i + 1 ~ 1 ) 2+ E( r4 - l4 ' r "+ 1 ~ COSa ) 2 • ( 12 ^ 

i=l i=2 

The flexible chain model used only the first term in (I12> . To 
compute Pa(x), given some input monomer configuration x G 
E, we use gradient descent minimization of V^aim terminated 
when the step size falls below a given threshold. The algorithm 
records the success of the projection by testing whether Khain 
is within a small tolerance value of zero. In the experiments 
reported below, the success rate for Pa was 100%. 

The packing constraint set B in the rotamer model is simply 
the set of monomer configurations x £ E satisfying E^ j (x) < 
Eo, where Eq specifies the energy depth of the search. For the 
constraint satisfaction problem to have a feasible point, or the 
difference map to have a fixed point, Eq must be greater than 
the ground state energy of the polymer. We again compute the 
corresponding projection, Pb, using gradient descent, but now 
with the function E^j. The termination criterion is also differ- 
ent, since we are only interested in crossing the E'lj (x) = Eo 
contour, rather than finding a local minimum. After cross- 
ing the target contour, we use Newton iterations to converge 
on the contour. In the event that the input x already satisfies 
Ehj(x) < Eq, the same x is returned as the output of the pro- 
jection. Crossing of the Eq contour is used as the criterion for a 
successful computation of Pg. Clearly the success rate depends 
on Eo. In our experiments the success rate for Pg was essen- 
tially 100%, since the target energy Eq is always such that find- 
ing a feasible point of Ej J j(x) < Eo is easy. This is because the 
target energies of relevance, those that apply in the dual con- 
straint problem, are always significantly above the minimum 
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Figure 4: Chain constraint projection applied to a monomer 
configuration (top) in the rotamer model. 



bining -EJ c hain with -Elj (thereby modifying set B), would be a 
mistake because the former has a very long-range character, in 
contrast with the latter, and the projection would almost always 
be blind to the possibility of favorable monomer contacts. Our 
solution was to combine a modified form of -E c hain with Eh.f- 

1 N ^ 

^chain = 4 22(1 - Ti-n ■ r ii+1 )w(ri- U )w(r u+ i) , (13) 

8=2 

where 

, , f 1 if r < 1 

W(r) = ( l-(l/r 2 -l) 2 ifr>l. (U) 

A modification of this kind is valid, since any solution x £ An 
B satisfies the chain constraint, and E' cheiin reduces to -©chain- 

Gradient descent to a constraint set specified by the contours 
of a function, is only distance minimizing when the constraint 
function is linear. We considered the possibility, when seeking 
a nearby point on a contour, say V(x) — Vo, that it may be 
advantageous to perform gradient descent on a "guiding func- 
tion", say G(x). The descent would still be terminated at the 
contour of the original function; the role of the guiding func- 
tion is only to minimize the length of the path to the contour. In 
the rotamer model we obtained good quality projections with- 
out the use of guiding functions. In the flexible chain model, 
however, we used the guiding function 

G(x) = £ L j(C H p;x) . (15) 

G(x) omits the chain bending energy E' chaili anc ' allows for a 
modified value of the Lennard- Jones parameter Chp. The neg- 
ative value of Chp in the model has the effect that during gradi- 
ent descent the condensed monomers may fission into separated 
H and P domains. This is avoided by giving Chp a non-negative 
value in the guiding function. 



energy of the pure packing problem (no chain constraint). Be- 
cause the inputs to projections generated by the difference map 
scheme can fall within regions where Plj diverges sharply, we 
modified the Lennard- Jones potential to have the form a — b 7y 
for separations r,j < 0.9, with a and b chosen to make and 
its first derivatives continuous. All the folds discovered by the 
algorithm have > 0.9 for all monomer pairs i and j. 

Figure 4 shows the action of the chain constraint projection, 
Pa, on a configuration of monomers in the rotamer model with 
cos a = 0.5. The H/P sequence is known only to Pa; in this 
example it is periodic with a three element motif: (HPP) 8 . 
The packing constraint, Pg, is blind to the sequence ordering 
of monomers. 

The formulation of the packing constraint set, and the com- 
putation of its projection, was somewhat different in the flexi- 
ble chain model. This example illustrates both the pitfalls in the 
naive application of the difference map algorithm, as well as its 
flexibility. The chain energy (II Q would seem to have its natural 
place in defining the chain constraint A. However, this would 
entail having to specify another adjustable energy parameter in 
addition to the packing energy Eq. The other option, of com- 



Results 

Rotamer model. A useful record of the progress of the dif- 
ference map algorithm is the time series of difference magni- 
tudes, S t = ||A(xt)||. In our folding application, S t is the rms- 
displacement (in units of the chain's bond length) of monomers 
in two configurations: one satisfying the chain constraint, the 
other satisfying the packing (energy) constraint. The algorithm 
terminates when St = 0, that is, when a valid polymer geometry 
is found with energy below the chosen target value Eo (a "fea- 
sible solution"). Figure 5 shows a difference plot with (3 = 1.2 
for the sequence (HPP)g in the rotamer model with geometry 
cos a = 0.5 and target energy Eq = —24. The behavior of St 
in the folding problem is typical of behavior observed in other 
applications|7|. The three stages of the solution process are 
evident in (1) the initial (very fast) decay during convergence 
to the quasi-attractor, (2) steady-state fluctuations as the quasi- 
attractor is searched, and (3) a final (fast) decay to zero when 
the solution (a fixed point) is discovered. As in phase retrieval, 
the distribution of run times (total iterations) is exponential! 6 1 
and consistent with the interpretation of a very fast relaxation 
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Figure 5: Evolution of the rms displacement of monomers, 5*, 
between two configurations that satisfy the chain and packing 
constraints, respectively. The fold shown in Figure 6 was found 
in just over 6000 iterations. 



of the probability distribution on the quasi-attractor. For the pa- 
rameters given, the average number of iterations per solution 
was J avc = 7500. 

The feasible solutions found by the difference map for given 
target energies Eq were refined by steepest descent minimiza- 
tion of the heteropolymer energy; the chain geometry was main- 
tained by adding the penalty function H2\ with a large multi- 
plier. For each run of the algorithm we therefore obtain one 
locally minimized fold with energy guaranteed to be below Eq. 
In the example above, about half of the outputs had the same re- 
fined energy of —25.048 and structure (or enantiomorph). Since 
this also is the lowest energy obtained, we have good reason to 
believe this is the ground state. The structure, shown in Figure 
6, resembles a cut trefoil knot. 

The most direct measure of the work performed by the al- 
gorithm is the average number of iterations per solution 7 avc , 
divided by the rate po with which the lowest energy fold (pu- 
tative ground state) is obtained. This is a number that we ex- 
pect to grow exponentially with the length of the polymer, and 
roughly corresponds to the number of conformations that must 
be sampled before one can claim to have discovered the ground 
state. For the example above, I avc /po ~ 15000. We repeated 
the above experiment with longer sequences having the same 
repeating motif. The size N = 36 is about the limit of where 
the ground state can be established with modest computing re- 
sources (a single processor). As argued below, it may be pos- 
sible to exceed this limit for "well designed" sequences. Our 
rotamer model experiments are summarized in Table 1 . 

Flexible chain model. Studies of this model by other 
investigators [8 9 1 have been limited to Fibonacci sequences F k , 
defined by 

F =K,F 1 =P, F k+1 =F k - 1 F k . (16) 

The tendency toward hydrophobic core formation is even 
stronger for the Lennard- Jones parameters of the flexible chain 
model. For the Fibonacci sequences, in particular, the chain 



Figure 6: The fold having the lowest energy for the sequence 
(HPP)8 in the rotamer model has the shape of a cut trefoil knot. 



bending energy must be sacrificed in order to allow the chain 
to weave between the hydrophobic core and polar envelope. To 
improve the packing projection we therefore used the guiding 
function dl5> . which omits the bending energy, and Chp = 
for the short chains, Chp = 0.1 for N = 55. A sign change of 
the difference map parameter /3, which effectively interchanges 
the two constraint sets, gave somewhat better results in the flex- 
ible chain model. 

Our results for Fibonacci chains up to N = 55 are summa- 
rized and compared with other algorithms in Table 2. The dif- 
ference map corroborates the ground state candidates found by 
the ELP| 1 1 1 algorithm for chains up to N = 34, and finds a 
lower energy fold for N = 55. All the best folds have a well 
developed hydrophobic core; the N = 55 chain shown in Fig- 
ure 7 is a good example. The latter fold was only obtained in 
one run, and we are therefore far from claiming to have found 
the ground state. 

Low energy folds in the flexible chain model for sequences 
containing adjacent H monomers are qualitatively different 
from the low energy folds for Fibonacci sequences, in which 
H monomers are never adjacent. A good example is provided 
by the N = 25 sequence H(HPPH)g, which was designed to 
realize a particular ground state geometry. Using the difference 
map it is easy to establish the ground state shown in Figure 8. 
This fold is unique in that it simultaneously minimizes the bend- 
ing energy where the chain passes through the icosahedral core, 
and also arranges the six hairpin turns so that the P monomers 
there form the largest number of contacts. 

Discussion 

The difference map folding algorithm was shown to be compet- 
itive with leading algorithms in experiments with model pro- 
teins. We conclude by discussing two issues that will be impor- 
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Figure 8: Ground state of the designed sequence H(HPPH)6 in 
the flexible chain model. The 13 H monomers form the vertices 
and center of an almost perfect icosahedron. Apart from the 
final bond in the structure, the chain geometry is approximately 
symmetric with respect to a 2-fold axis. 



Figure 7: Fold with lowest known energy for the N — 55 Fi- 
bonacci sequence in the flexible chain model; top: chain geom- 
etry, bottom: monomer packing. 



tant in applications to realistic protein models. 

Designed sequences. The performance of an iterative phase 
retrieval algorithm, of which the difference map folding algo- 
rithm is a logical descendent, is sensitively dependent on the 
degree to which the input data is overdetermined|7|. We be- 
lieve that the latter attribute's counterpart in protein folding is 
the property of being "well designed". 

In the context of the geometry of the difference map, a highly 
overdetermined problem corresponds to the situation where the 
probability of nonempty intersection of the constraint sets A 
and B, given a specification by random data, is exceedingly 
small. This makes the existence of a solution all the more un- 
usual. In phase retrieval one is guaranteed a solution in even 
these unlikely circumstances, and moreover, the uniqueness of 
the solution and efficiency of the solution process relies on this 
fact. 



Whether the simple protein models studied above have the 
capacity for realizing highly overdetermined problem instances 
(sequences) is open to speculation. With our choice of de- 
constructing the energy landscape into chain and packing con- 
straints, this would imply the existence of exceptionally low en- 
ergy monomer packings that nevertheless can be threaded by a 
particular sequence. Folds with these properties should be eas- 
ier to find, because the target energy Eq of the difference map 
algorithm could be set at a lower value and thereby eliminate a 
large part of the energy landscape. One experiment to test this 
hypothesis, in the flexible chain model, would be to fold ran- 
dom sequences of 13 H and 12 P monomers and compare per- 
formance, as well as ground state energies, with the designed 
sequence H(HPPH) 6 . 

More realistic models. A serious deficiency in the protein 
models studied above is the omission of the hydrogen bonding 
mechanism that acts on the peptide geometry and is responsible 
for the two distinctive types of secondary structure. Implement- 
ing this level of detail lies at the heart of applying the difference 
map algorithm to realistic models. How big a space should the 
constraints be embedded within? It is cetainly not enough, as in 
the rotamer and flexible chain models, to embed a protein of N 
residues in a space of dimension 3A^. The orientations and in- 
ternal dihedral angles of side groups require additional coordi- 
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N 


sequence 


Edm 


E 


a 


■^ave 


Po 


time/iter 


24 


(HPP) 8 


-25.048 


-24.0 


1.2 


7500 


0.50 


12 msec 


30 


(HPP)io 


-34.900 


-33.0 


1.2 


23000 


0.07 


18 


36 


(HPP) 12 


-45.851 


-42.5 


1.2 


150000 




26 



Table 1: Results for the rotamer model. Eq is the target energy of the difference map (DM) algorithm, 7 avc the average number 
of iterations to find the target energy, and po is the probability that the discovered fold refines to the lowest energy obtained in the 
experiment, -Edm- The last column gives the cpu time per iteration on a 1.67 GHz processor. 



N 


sequence 


EpERM 


Emvca 


Eelp 


Edm 


E 


P 


•*ave 


Po 


time/iter 


13 


^6 


-4.962 


-4.967 


-4.967 


-4.975 


-4.5 


-1 


34 


0.34 


3 msec 


21 


F 7 


-11.524 


-12.296 


-12.316 


-12.327 


-11.8 


-1 


2900 


0.024 


25 


34 


Fs 


-21.568 


-25.321 


-25.476 


-25.512 


-23.5 


-1 


10000 


0.007 


80 


55 


F 9 


-32.884 


-41.502 


-42.428 


-43.331 


-38.0 


-1 


27000 




200 


25 


H(HPPH) 6 








-28.313 


-27.4 


-1 


9200 


0.030 


45 



Table 2: Results for the flexible chain model. Ground state energy estimates obtained by the difference map (DM) algorithm are 
compared with three other algorithms: pruned-enriched Rosenbluth method (PERM|8|), multicanonical sampling (MUCA|9|), 
and energy landscape paving (ELP|9|). Optimal structures found by ELP and DM for N = 13, 21, and 34 Fibonacci sequences 
are essentially the same| 12 j. For N = 55 DM finds a different, lower energy fold. See Table 1 for definitions of DM parameters. 



nates in their specification. The peptide chain geometry is also 
more complicated, and with its two dihedral angles per residue 
may require twice the embedding dimension than its analog in 
the rotamer model. 

These remarks should make clear that there is no automatic 
procedure for effectively applying the difference map algorithm 
to arbitrary optimization problems. An essential part of the en- 
deavor is the formulation as a constraint satisfaction problem, 
that is, a recipe for deconstructing the energy landscape. 
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